Arabic tweeps dialect prediction based on machine learning approach
نویسندگان
چکیده
In this paper, we present our approach for profiling Arabic authors on twitter, based their tweets. We consider here the dialect of an author as important trait to be predicted. For purpose, many indicators, feature vectors and machine learning-based classifiers were implemented. The results these compared find out best prediction model. model was obtained using random forest classifier with full forms stems vector.
منابع مشابه
Arabic Tweeps Gender and Dialect Prediction
In this paper, we present our approach for author profiling task based on Arabic content (Twitter case), which was one of the tasks required in PAN at CLEF 2017. Author profiling is the process of identifying authors’ traits, which constitute the profile of an author, by analysing his/her writings. In our research, we considered the gender and the variety (dialect) of an author as two important...
متن کاملArabic Dialect Handling in Hybrid Machine Translation
In this paper, we describe an extension to a hybrid machine translation system for handling dialect Arabic, using a decoding algorithm to normalize non-standard, spontaneous and dialectal Arabic into Modern Standard Arabic. We prove the feasibility of the approach by measuring and comparing machine translation results in terms of BLEU with and without the proposed approach. We show in our tests...
متن کاملMachine Translation Experiments on PADIC: A Parallel Arabic DIalect Corpus
We present in this paper PADIC, a Parallel Arabic DIalect Corpus we built from scratch, then we conducted experiments on crossdialect Arabic machine translation. PADIC is composed of dialects from both the Maghreb and the Middle-East. Each dialect has been aligned with Modern Standard Arabic (MSA). Three dialects from Maghreb are concerned by this study: two from Algeria, one from Tunisia, and ...
متن کاملDomain and Dialect Adaptation for Machine Translation into Egyptian Arabic
In this paper, we present a statistical machine translation system for English to Dialectal Arabic (DA), using Modern Standard Arabic (MSA) as a pivot. We create a core system to translate from English to MSA using a large bilingual parallel corpus. Then, we design two separate pathways for translation from MSA into DA: a two-step domain and dialect adaptation system and a one-step simultaneous...
متن کاملArabic to French Machine Translation System based on DCF Approach
Machine translation is one of the disciplines that cover the automatic language processing domain. There are two different approaches to realize a machine translation system; linguistic approach, based on a set of theories and rules that govern the processed language and statistical one, based on probabilities and mathematic theories. Our system emerges from the linguistic school and contains i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Electrical and Computer Engineering
سال: 2021
ISSN: ['2088-8708']
DOI: https://doi.org/10.11591/ijece.v11i2.pp1627-1633